IEICE global.ieice.org Site

Author Search Result

[Author] Yang LI(82hit)

41-60hit(82hit)

Analysis of a New High-Speed DC Switch Repulsion Mechanism
Yi WU Hailong HE Zhengyong HU Fei YANG Mingzhe RONG Yang LI

PAPER

Vol:
E94-C No:9
Page(s):
1409-1415
This paper focuses on the research of a new high-speed DC switch repulsion mechanism with experimental and simulation methods. Multi-physical equations reflecting the transient electromagnetic field, electric circuit, mechanical motion and material deformation are coupled in the calculation. For the reason of accuracy, skin effect and the proximity effect caused by the current in the coil are also taken into account. According to the simulation results, which indicate several key parameters severely affecting the mechanism speed, a high-speed DC switch repulsion mechanism is developed. By the test of mechanism motion, its average speed can be up to 8.4 m/s and its mechanism response time is 250 µs, which verifies the simulation results. Furthermore, during high speed motion the stress on the metal plate and moving contact is also discussed. It is noticed that the influence of the material deformation on the mechanical motion is very important.
Construction of Resilient Boolean and Vectorial Boolean Functions with High Nonlinearity
Luyang LI Dong ZHENG Qinglan ZHAO

LETTER-Cryptography and Information Security

Vol:
E102-A No:10
Page(s):
1397-1401
Boolean functions and vectorial Boolean functions are the most important components of stream ciphers. Their cryptographic properties are crucial to the security of the underlying ciphers. And how to construct such functions with good cryptographic properties is a nice problem that worth to be investigated. In this paper, using two small nonlinear functions with t-1 resiliency, we provide a method on constructing t-resilient n variables Boolean functions with strictly almost optimal nonlinearity >2n-1-2n/2 and optimal algebraic degree n-t-1. Based on the method, we give another construction so that a large class of resilient vectorial Boolean functions can be obtained. It is shown that the vectorial Boolean functions also have strictly almost optimal nonlinearity and optimal algebraic degree.
Two Adaptive Energy Detectors for Cognitive Radio Systems
Siyang LIU Gang XIE Zhongshan ZHANG Yuanan LIU

LETTER-Wireless Communication Technologies

Vol:
E92-B No:6
Page(s):
2332-2335
Two adaptive energy detectors are proposed for cognitive radio systems to detect the primary users. Unlike the conventional energy detector (CED) where a decision is made after receiving all samples, our detectors make a decision with the sequential arrival of samples. Hence, the sample size of the proposed detectors is adaptive. Simulation results show that for a desired performance, the average sample size of the proposed detectors is much less than that of the CED. Therefore, they are more agile than the CED.
Rectifying Transformation Networks for Transformation-Invariant Representations with Power Law
Chunxiao FAN Yang LI Lei TIAN Yong LI

LETTER-Image Recognition, Computer Vision

Pubricized:
2018/12/04
Vol:
E102-D No:3
Page(s):
675-679
This letter proposes a representation learning framework of convolutional neural networks (Convnets) that aims to rectify and improve the feature representations learned by existing transformation-invariant methods. The existing methods usually encode feature representations invariant to a wide range of spatial transformations by augmenting input images or transforming intermediate layers. Unfortunately, simply transforming the intermediate feature maps may lead to unpredictable representations that are ineffective in describing the transformed features of the inputs. The reason is that the operations of convolution and geometric transformation are not exchangeable in most cases and so exchanging the two operations will yield the transformation error. The error may potentially harm the performance of the classification networks. Motivated by the fractal statistics of natural images, this letter proposes a rectifying transformation operator to minimize the error. The proposed method is differentiable and can be inserted into the convolutional architecture without making any modification to the optimization algorithm. We show that the rectified feature representations result in better classification performance on two benchmarks.
Learning a Similarity Constrained Discriminative Kernel Dictionary from Concatenated Low-Rank Features for Action Recognition
Shijian HUANG Junyong YE Tongqing WANG Li JIANG Changyuan XING Yang LI

LETTER-Pattern Recognition

Pubricized:
2015/11/16
Vol:
E99-D No:2
Page(s):
541-544
Traditional low-rank feature lose the temporal information among action sequence. To obtain the temporal information, we split an action video into multiple action subsequences and concatenate all the low-rank features of subsequences according to their time order. Then we recognize actions by learning a novel dictionary model from concatenated low-rank features. However, traditional dictionary learning models usually neglect the similarity among the coding coefficients and have bad performance in dealing with non-linearly separable data. To overcome these shortcomings, we present a novel similarity constrained discriminative kernel dictionary learning for action recognition. The effectiveness of the proposed method is verified on three benchmarks, and the experimental results show the promising results of our method for action recognition.
Structural Analysis of Nonbinary Cyclic and Quasi-Cyclic LDPC Codes with α-Multiplied Parity-Check Matrices
Haiyang LIU Hao ZHANG Lianrong MA Lingjun KONG

LETTER-Coding Theory

Pubricized:
2020/05/12
Vol:
E103-A No:11
Page(s):
1299-1303
In this letter, the structural analysis of nonbinary cyclic and quasi-cyclic (QC) low-density parity-check (LDPC) codes with α-multiplied parity-check matrices (PCMs) is concerned. Using analytical methods, several structural parameters of nonbinary cyclic and QC LDPC codes with α-multiplied PCMs are determined. In particular, some classes of nonbinary LDPC codes constructed from finite fields and finite geometries are shown to have good minimum and stopping distances properties, which may explain to some extent their wonderful decoding performances.
Silicon Photonics Research in Hong Kong: Microresonator Devices and Optical Nonlinearities
Andrew W. POON Linjie ZHOU Fang XU Chao LI Hui CHEN Tak-Keung LIANG Yang LIU Hon K. TSANG

INVITED PAPER

Vol:
E91-C No:2
Page(s):
156-166
In this review paper we showcase recent activities on silicon photonics science and technology research in Hong Kong regarding two important topical areas--microresonator devices and optical nonlinearities. Our work on silicon microresonator filters, switches and modulators have shown promise for the nascent development of on-chip optoelectronic signal processing systems, while our studies on optical nonlinearities have contributed to basic understanding of silicon-based optically-pumped light sources and helium-implanted detectors. Here, we review our various passive and electro-optic active microresonator devices including (i) cascaded microring resonator cross-connect filters, (ii) NRZ-to-PRZ data format converters using a microring resonator notch filter, (iii) GHz-speed carrier-injection-based microring resonator modulators and 0.5-GHz-speed carrier-injection-based microdisk resonator modulators, and (iv) electrically reconfigurable microring resonator add-drop filters and electro-optic logic switches using interferometric resonance control. On the nonlinear waveguide front, we review the main nonlinear optical effects in silicon, and show that even at fairly modest average powers two-photon absorption and the accompanied free-carrier linear absorption could lead to optical limiting and a dramatic reduction in the effective lengths of nonlinear devices.
Fast Montgomery Modular Multiplication and Squaring on Embedded Processors
Yang LI Jinlin WANG Xuewen ZENG Xiaozhou YE

PAPER-Fundamental Theories for Communications

Pubricized:
2016/12/06
Vol:
E100-B No:5
Page(s):
680-690
Montgomery modular multiplication is one of the most efficient algorithms for modular multiplication of large integers. On resource-constraint embedded processors, memory-access operations play an important role as arithmetic operations in the modular multiplication. To improve the efficiency of Montgomery modular multiplication on embedded processors, this paper concentrates on reducing the memory-access operations through adding a few working registers. We first revisit previous popular Montgomery modular multiplication algorithms, and then present improved algorithms for Montgomery modular multiplication and squaring for arbitrary prime fields. The algorithms adopt the general ideas of hybrid multiplication algorithm proposed by Gura and lazy doubling algorithm proposed by Lee. By careful optimization and redesign, we propose novel implementations for Montgomery multiplication and squaring called coarsely integrated product and operand hybrid scanning algorithm (CIPOHS) and coarsely integrated lazy doubling algorithm (CILD). Then, we implement the algorithms on general MIPS64 processor and OCTEON CN6645 processor equipped with specific multiply-add instructions. Experiments show that CIPOHS and CILD offer the best performance both on the general MIPS64 and OCTEON CN6645 processors. But the proposed algorithms have obvious advantages for the processors with specific multiply-add instructions such as OCTEON CN6645. When the modulus is 2048 bits, the CIPOHS and CILD outperform the CIOS algorithm by a factor of 47% and 58%, respectively.
TRLMS: Two-Stage Resource Scheduling Algorithm for Cloud Based Live Media Streaming System
Wei WEI Yang LIU Yuhong ZHANG

LETTER

Vol:
E97-D No:7
Page(s):
1731-1734
This letter proposes an efficient Two-stage Resource scheduling algorithm for cloud based Live Media Streaming system (TRLMS). It transforms the cloud-based resource scheduling problem to a min-cost flow problem in a graph, and solves it by an improved Successive Short Path (SSP) algorithm. Simulation results show that TRLMS can enhance user demand satisfaction by 17.1% than mean-based method, and its time complexity is much lower than original SSP algorithm.
A Power-Saving Technique for the OSGi Platform
Kuo-Yi CHEN Chin-Yang LIN Tien-Yan MA Ting-Wei HOU

PAPER-Software System

Vol:
E95-D No:5
Page(s):
1417-1426
With more digital home appliances and network devices having OSGi as the software management platform, the power-saving capability of the OSGi platform has become a critical issue. This paper is aimed at improving the power-efficiency of the OSGi platform, i.e. reducing the energy consumption with minimum performance degradation. The key to this study is an efficient power-saving technique which exploits the runtime information already available in a Java virtual machine (JVM), the base software of the OSGi platform, to best determine the timing of performing DVFS (Dynamic Voltage and Frequency Scaling). This, technically, involves a phase detection scheme that identifies the memory phase of the OSGi-enabled device/server in a correct and almost effortless way. The overhead of the power-saving procedure is thus minimized, and the system performance is well maintained. We have implemented and evaluated the proposed power-saving approach on an OSGi server, where the Apache Felix OSGi implementation and the DaCapo benchmarks were applied. The results show that this approach can achieve real power-efficiency for the OSGi platform, in which the power consumption is significantly reduced and the performance remains highly competitive, compared with the other power-saving techniques.
Full Diversity Full Rate Cyclotomic Orthogonal Space-Time Block Codes for MIMO Wireless Systems
Hua JIANG Kanglian ZHAO Yang LI Sidan DU

LETTER-Wireless Communication Technologies

Vol:
E95-B No:10
Page(s):
3349-3352
In this letter we design a new family of space-time block codes (STBC) for multi-input multi-output (MIMO) systems. The complex orthogonal STBC achieves full diversity and full transmission rate with fast maximum-likelihood decoding when only two transmit antennas are employed. By combining the Alamouti STBC and the multidimensional signal constellation rotation based on the cyclotomic number field, we construct cyclotomic orthogonal space-time block codes (COSTBCs) which can achieve full diversity and full rate for multiple transmit antennas. Theoretical analysis and simulation results demonstrate excellent performance of the proposed codes, while the decoding complexity is further reduced.
DFAM-DETR: Deformable Feature Based Attention Mechanism DETR on Slender Object Detection
Feng WEN Mei WANG Xiaojie HU

PAPER-Image Recognition, Computer Vision

Pubricized:
2022/12/09
Vol:
E106-D No:3
Page(s):
401-409
Object detection is one of the most important aspects of computer vision, and the use of CNNs for object detection has yielded substantial results in a variety of fields. However, due to the fixed sampling in standard convolution layers, it restricts receptive fields to fixed locations and limits CNNs in geometric transformations. This leads to poor performance of CNNs for slender object detection. In order to achieve better slender object detection accuracy and efficiency, this proposed detector DFAM-DETR not only can adjust the sampling points adaptively, but also enhance the ability to focus on slender object features and extract essential information from global to local on the image through an attention mechanism. This study uses slender objects images from MS-COCO dataset. The experimental results show that DFAM-DETR achieves excellent detection performance on slender objects compared to CNN and transformer-based detectors.
More on Incorrigible Sets of Binary Linear Codes
Lingjun KONG Haiyang LIU Lianrong MA

LETTER-Coding Theory

Pubricized:
2022/10/31
Vol:
E106-A No:5
Page(s):
863-867
This letter is concerned with incorrigible sets of binary linear codes. For a given binary linear code C, we represent the numbers of incorrigible sets of size up to ⌈3/2d - 1⌉ using the weight enumerator of C, where d is the minimum distance of C. In addition, we determine the incorrigible set enumerators of binary Golay codes G23 and G24 through combinatorial methods.
A Multitask Learning Approach Based on Cascaded Attention Network and Self-Adaption Loss for Speech Emotion Recognition
Yang LIU Yuqi XIA Haoqin SUN Xiaolei MENG Jianxiong BAI Wenbo GUAN Zhen ZHAO Yongwei LI

PAPER-Speech and Hearing

Pubricized:
2022/12/08
Vol:
E106-A No:6
Page(s):
876-885
Speech emotion recognition (SER) has been a complex and difficult task for a long time due to emotional complexity. In this paper, we propose a multitask deep learning approach based on cascaded attention network and self-adaption loss for SER. First, non-personalized features are extracted to represent the process of emotion change while reducing external variables' influence. Second, to highlight salient speech emotion features, a cascade attention network is proposed, where spatial temporal attention can effectively locate the regions of speech that express emotion, while self-attention reduces the dependence on external information. Finally, the influence brought by the differences in gender and human perception of external information is alleviated by using a multitask learning strategy, where a self-adaption loss is introduced to determine the weights of different tasks dynamically. Experimental results on IEMOCAP dataset demonstrate that our method gains an absolute improvement of 1.97% and 0.91% over state-of-the-art strategies in terms of weighted accuracy (WA) and unweighted accuracy (UA), respectively.
GazeFollowTR: A Method of Gaze Following with Reborn Mechanism
Jingzhao DAI Ming LI Xuejiao HU Yang LI Sidan DU

PAPER-Vision

Pubricized:
2022/11/30
Vol:
E106-A No:6
Page(s):
938-946
Gaze following is the task of estimating where an observer is looking inside a scene. Both the observer and scene information must be learned to determine the gaze directions and gaze points. Recently, many existing works have only focused on scenes or observers. In contrast, revealed frameworks for gaze following are limited. In this paper, a gaze following method using a hybrid transformer is proposed. Based on the conventional method (GazeFollow), we conduct three developments. First, a hybrid transformer is applied for learning head images and gaze positions. Second, the pinball loss function is utilized to control the gaze point error. Finally, a novel ReLU layer with the reborn mechanism (reborn ReLU) is conducted to replace traditional ReLU layers in different network stages. To test the performance of our developments, we train our developed framework with the DL Gaze dataset and evaluate the model on our collected set. Through our experimental results, it can be proven that our framework can achieve outperformance over our referred methods.
MCGCN: Multi-Correlation Graph Convolutional Network for Pedestrian Attribute Recognition
Yang YU Longlong LIU Ye ZHU Shixin CEN Yang LI

PAPER-Image Recognition, Computer Vision

Pubricized:
2023/11/29
Vol:
E107-D No:3
Page(s):
400-410
Pedestrian attribute recognition (PAR) aims to recognize a series of a person's semantic attributes, e.g., age, gender, which plays an important role in video surveillance. This paper proposes a multi-correlation graph convolutional network named MCGCN for PAR, which includes a semantic graph, visual graph, and synthesis graph. We construct a semantic graph by using attribute features with semantic constraints. A graph convolution is employed, based on prior knowledge of the dataset, to learn the semantic correlation. 2D features are projected onto visual graph nodes and each node corresponds to the feature region of each attribute group. Graph convolution is then utilized to learn regional correlation. The visual graph nodes are connected to the semantic graph nodes to form a synthesis graph. In the synthesis graph, regional and semantic correlation are embedded into each other through inter-graph edges, to guide each other's learning and to update the visual and semantic graph, thereby constructing semantic and regional correlation. On this basis, we use a better loss weighting strategy, the suit_polyloss, to address the imbalance of pedestrian attribute datasets. Experiments on three benchmark datasets show that the proposed approach achieves superior recognition performance compared to existing technologies, and achieves state-of-the-art performance.
Deep Discriminative Supervised Hashing via Siamese Network
Yang LI Zhuang MIAO Jiabao WANG Yafei ZHANG Hang LI

LETTER-Artificial Intelligence, Data Mining

Pubricized:
2017/09/12
Vol:
E100-D No:12
Page(s):
3036-3040
The latest deep hashing methods perform hash codes learning and image feature learning simultaneously by using pairwise or triplet labels. However, generating all possible pairwise or triplet labels from the training dataset can quickly become intractable, where the majority of those samples may produce small costs, resulting in slow convergence. In this letter, we propose a novel deep discriminative supervised hashing method, called DDSH, which directly learns hash codes based on a new combined loss function. Compared to previous methods, our method can take full advantages of the annotated data in terms of pairwise similarity and image identities. Extensive experiments on standard benchmarks demonstrate that our method preserves the instance-level similarity and outperforms state-of-the-art deep hashing methods in the image retrieval application. Remarkably, our 16-bits binary representation can surpass the performance of existing 48-bits binary representation, which demonstrates that our method can effectively improve the speed and precision of large scale image retrieval systems.
Further Results on the Minimum and Stopping Distances of Full-Length RS-LDPC Codes
Haiyang LIU Hao ZHANG Lianrong MA

LETTER-Coding Theory

Vol:
E100-A No:2
Page(s):
738-742
Based on the codewords of the [q,2,q-1] extended Reed-Solomon (RS) code over the finite field Fq, we can construct a regular binary γq×q2 matrix H(γ,q), where q is a power of 2 and γ≤q. The matrix H(γ,q) defines a regular low-density parity-check (LDPC) code C(γ,q), called a full-length RS-LDPC code. Using some analytical methods, we completely determine the values of s(H(4,q)), s(H(5,q)), and d(C(5,q)) in this letter, where s(H(γ,q)) and d(C(γ,q)) are the stopping distance of H(γ,q) and the minimum distance of C(γ,q), respectively.
Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection
Haiyang LI Tieran ZHENG Guibin ZHENG Jiqing HAN

PAPER-Speech and Hearing

Vol:
E97-D No:3
Page(s):
554-561
In this paper, we propose a novel confidence measure to improve the performance of spoken term detection (STD). The proposed confidence measure is based on the context consistency between a hypothesized word and its context in a word lattice. The main contribution of this paper is to compute the context consistency by considering the uncertainty in the results of speech recognition and the effect of topic. To measure the uncertainty of the context, we employ the word occurrence probability, which is obtained through combining the overlapping hypotheses in a word posterior lattice. To handle the effect of topic, we propose a method of topic adaptation. The adaptation method firstly classifies the spoken document according to the topics and then computes the context consistency of the hypothesized word with the topic-specific measure of semantic similarity. Additionally, we apply the topic-specific measure of semantic similarity by two means, and they are performed respectively with the information of the top-1 topic and the mixture of all topics according to topic classification. The experiments conducted on the Hub-4NE Mandarin database show that both the occurrence probability of context word and the topic adaptation are effective for the confidence measure of STD. The proposed confidence measure performs better compared with the one ignoring the uncertainty of the context or the one using a non-topic method.
E-Band 65nm CMOS Low-Noise Amplifier Design Using Gain-Boost Technique
Kosuke KATAYAMA Mizuki MOTOYOSHI Kyoya TAKANO Chen Yang LI Shuhei AMAKAWA Minoru FUJISHIMA

PAPER

Vol:
E97-C No:6
Page(s):
476-485
E-band communication is allocated to the frequency bands of 71-76 and 81-86GHz. Radio-frequency (RF) front-end components for E-band communication have been realized using compound semiconductor technology. To realize a CMOS LNA for E-band communication, we propose a gain-boosted cascode amplifier (GBCA) stage that simultaneously provides high gain and stability. Designing an LNA from scratch requires considerable time because the tuning of matching networks with consideration of the parasitic elements is complicated. In this paper, we model the characteristics of devices including the effects of their parasitic elements. Using these models, an optimizer can estimate the characteristic of a designed LNA precisely without electromagnetic simulations and gives us the design values of an LNA when the layout constraint is ignored. Starting from the values, a four-stage LNA with a GBCA stage is designed very easily even though the layout constraint is considered and fabricated by a 65nm LP CMOS process. The fabricated LNA is measured, and it is confirmed that it achieves 18.5GHz bandwidth and over 24.3dB gain with 50.6mW power consumption. This is the first LNA to achieve a gain bandwidth of over 300GHz in the E-band among the LNAs utilizing any kind of semiconductor technologies. In this paper, we have proved that CMOS technology, which is suitable for baseband and digital circuitry, is applicable to a communication system covering the entire E-band.

41-60hit(82hit)

Author Search Result

[Author] Yang LI(82hit)

Analysis of a New High-Speed DC Switch Repulsion Mechanism

Construction of Resilient Boolean and Vectorial Boolean Functions with High Nonlinearity

Two Adaptive Energy Detectors for Cognitive Radio Systems

Rectifying Transformation Networks for Transformation-Invariant Representations with Power Law

Learning a Similarity Constrained Discriminative Kernel Dictionary from Concatenated Low-Rank Features for Action Recognition

Structural Analysis of Nonbinary Cyclic and Quasi-Cyclic LDPC Codes with α-Multiplied Parity-Check Matrices

Silicon Photonics Research in Hong Kong: Microresonator Devices and Optical Nonlinearities

Fast Montgomery Modular Multiplication and Squaring on Embedded Processors

TRLMS: Two-Stage Resource Scheduling Algorithm for Cloud Based Live Media Streaming System

A Power-Saving Technique for the OSGi Platform

Full Diversity Full Rate Cyclotomic Orthogonal Space-Time Block Codes for MIMO Wireless Systems

DFAM-DETR: Deformable Feature Based Attention Mechanism DETR on Slender Object Detection

More on Incorrigible Sets of Binary Linear Codes

A Multitask Learning Approach Based on Cascaded Attention Network and Self-Adaption Loss for Speech Emotion Recognition

GazeFollowTR: A Method of Gaze Following with Reborn Mechanism

MCGCN: Multi-Correlation Graph Convolutional Network for Pedestrian Attribute Recognition

Deep Discriminative Supervised Hashing via Siamese Network

Further Results on the Minimum and Stopping Distances of Full-Length RS-LDPC Codes

Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection

E-Band 65nm CMOS Low-Noise Amplifier Design Using Gain-Boost Technique

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles